A tight upper bound on the size of the antidictionary of a binary string
نویسندگان
چکیده
An antidictionary is a set of words that never appear in a binary string. In 2000, Crochemore et al. (2000) presented a compression algorithm of binary text using antidictionary called DCA. Their coding algorithm has been tested on the Calgary Corpus, and their experimental results show that we get compression ratios equivalent to those of most common compressors such as pkzip. Recently, an online source coding scheme based on DCA is presented to apply for compressing losslessly ECG (ElectroCardioGram) in Ota and Morita (2004). Experimental results show that their algorithm achieved 10% smaller compression ratio than LZ ones. In this article, we present au upper bound of the size of the antidictionary of a binary string. The upper bound we obtained is stronger than that in Crochemore et al. (1998). And it is tight in the sense there exists a string to attain the bound. We also proved that the antidictionary of a binary string is always smaller than or equal to that of the dictionary of the same string. Moreover, we give an algorithm to reconstruct the dictionary from the antidictionary. This article is organized as follows. Section 2 gives definitions on antidictionary with some examples. In Sections 3 and 4, we investigate the size of the antidictionary of a given string and derive a tight upper bound on its size. Section 5 presents an algorithm to reconstruct the dictionary from the antidictionary of a given string and Section 6 summarizes our results.
منابع مشابه
Sharp Upper bounds for Multiplicative Version of Degree Distance and Multiplicative Version of Gutman Index of Some Products of Graphs
In $1994,$ degree distance of a graph was introduced by Dobrynin, Kochetova and Gutman. And Gutman proposed the Gutman index of a graph in $1994.$ In this paper, we introduce the concepts of multiplicative version of degree distance and the multiplicative version of Gutman index of a graph. We find the sharp upper bound for the multiplicative version of degree distance and multiplicative ver...
متن کاملAn Upper Bound Approach for Analysis of Hydroforming of Sheet Metals
Considering a kinematical velocity admissible field, the upper bound method has beenused for predicting the amount of pressure in hydroforming of sheet metals. The effects of workhardening, friction and blank size have been considered in pressure prediction. Also the effect ofsheet thickness variation has been considered in the present work formulations. The relation betweenpressure and punch s...
متن کاملRelations Between Greedy and Bit-Optimal LZ77 Encodings
This paper investigates the size in bits of the LZ77 encoding, which is the most popular and efficient variant of the Lempel–Ziv encodings used in data compression. We prove that, for a wide natural class of variable-length encoders for LZ77 phrases, the size of the greedily constructed LZ77 encoding on constant alphabets is within a factor O( logn log log logn ) of the optimal LZ77 encoding, w...
متن کاملThe (non-)existence of perfect codes in Lucas cubes
A Fibonacci string of length $n$ is a binary string $b = b_1b_2ldots b_n$ in which for every $1 leq i < n$, $b_icdot b_{i+1} = 0$. In other words, a Fibonacci string is a binary string without 11 as a substring. Similarly, a Lucas string is a Fibonacci string $b_1b_2ldots b_n$ that $b_1cdot b_n = 0$. For a natural number $ngeq1$, a Fibonacci cube of dimension $n$ is denoted by $Gamma_n$ and i...
متن کاملA generalized upper bound solution for bimetallic rod extrusion through arbitrarily curved dies
In this paper, an upper bound approach is used to analyze the extrusion process of bimetallic rods through arbitrarily curved dies. Based on a spherical velocity field, internal, shearing and frictional power terms are calculated. The developed upper bound solution is used for calculating the extrusion force for two types of die shapes: a conical die as a linear die profile and a streamlined di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005